. CR»ror3 Corrected by the STIC Syst^3 Branch ^ l VS/T? 

/ 0 /M? W*fs> <T/4 ~RF Processing \X>*\o+J^tt ° 

. I OfUH^ } u?G?J>^ Edited bv:/ V\ ^ 



Sorter Numbor; 



□ Changed a file from non-ASCII to ASCII Y^^y : — — < ST| C staff) 

j "] Changed the margins in cases where the sequence ^V"^ ^P^fl^wro the next line. 
| | Edited a format error in the Current Application {^JlbCTon, specifically: 

| | Edited the Current Application Data section with the actual current number^ The number inputted by the 
applicant was □ the prior applfcation data; or □ other .[ . 

| | Added the mandatory heading and subheadings for 'Current Application Data". 

| | Edited the 'Number of Sequences" field. The applicant spelled out a number instead oftjsing an integer. 
| | Changed the spoiling of a mandatory field (the headings or subheadings), specifically: 

I I Corrected the SEQ ID NO when obviously incorrect. The sequence numbers that were edited were: 

| ) Inserted or corrected a nucleic number at the end of a nucleic line. SEQ ID NCs edited: 

I ] Corrected subheading placement. All responses must be on the same line as each subheading. If the 

applicant placed a response below the subheading, this was moved to its appropriate place. 

| | Inserted colons after headings/subheadings. Headings edited included: 

j j Deleted extra, invalid, headings used by an applicant, specifically: 

Deleted: Q^orvASCII •garbage" at the beginnino^e^})f files; □ secretary initials/filename at end of file; 
□ page numbers throughout text; □ other invafotflext, such as ; . 

. [„. | .Inserted mandatory headings, specifically: : 




| | Corrected an obvious error in the response, specifically: 



| | Edited identifiers where upper case is used but lower case is required, or vice versa. 

1 | Corrected an error in the Number of Sequences field, specifically: 



J^Ji A 'Hard Page Break" code was inserted by the applicant. All occurrences had to be deleted. 

| | Deleted ending stop codon in amino add sequences and adjusted the "(A)Length:* field accordingly (error 
due to a Patentln bug). Sequences corrected: ; 



I I Other: 



'Examiner: The above corrections must be communicated to the applicant In the first Office 
Action. DO NOT send a copy of this form. - 3/1/95 



RAW SEQUENCE LISTING DATE: 07/11/2002 

PATENT APPLICATION: US/10/042 , 665A TIME: 09:11:59 



Input Set : A:\ptoms.txt 

Output Set: N:\CRF3\07112002\J042665A.raw 



3 <110> APPLICANT: Schupp, Thomas 

4 Toupet, Christine 

5 Engel, Nathalie 

7 <120> TITLE OF INVENTION: Rifamycin biosynthesis gene cluster 
9 <130> FILE REFERENCE: 4-21001/B/C1 

11 <140> CURRENT APPLICATION NUMBER: 10/042, 665A 

12 <141> CURRENT FILING DATE: 2002-01-09 

14 <150> PRIOR APPLICATION NUMBER: 09/242,744 

15 <151> PRIOR FILING DATE: 1999-03-24 

17 <150> PRIOR APPLICATION NUMBER: PCT/EP97/04495 

18 <151> PRIOR FILING DATE: 1997-08-18 
20 <160> NUMBER OF SEQ ID NOS : 9 

22 <170> SOFTWARE: Patentln Ver . 2.1 

24 <210> SEQ ID NO: 1 

25 <211> LENGTH: 5676 

26 <212> TYPE: DNA 

27 <213> ORGANISM: Amcylatopsis mediterranei 

29 <400> SEQUENCE: 1 

30 ggtacccggt gttcgcgacg gcgttcgacg aggcttgcga gcagctggac gtctgtctgg 60 

31 ccggccgtgc cgggcaccgc gtgcgggacg tcgtgctcgg cgaagtgccc gccgaaaccg 120 

32 ggctgctgaa ccagacggtc ttcacccaag ccgggctgtt cgcggtggag agcgcgctgt 180 

33 tccggctcgc cgaatcctgg ggtgtccggc cggacgtggt gctcggccac tccatcgggg 240 

34 agatcaccgc cgcgtatgcc gcgggcgtct tctcgctgcc ggacgccgcc cggatcgtcg 300 

35 cggcgcgcgg ccggctgatg caggcgctgg cgccgggcgg ggcgatggtc gccgtcgccg 360 

36 cctccgaagc cgaggtggcc gaactgctcg gcgacggcgt ggaactcgcc gccgtcaacg 420 

37 gcccttcggc ggtagtcctt tccggggacg cggacgcggt cgtcgcggcc gccgcccgca 480 

38 tgcgcgagcg cgggcacaag accaagcagc tcaaggtttc gcacgcgttc cactccgcgc 540 

39 ggatggcgcc gatgctggcg gagttcgccg ccgagctggc cggcgtgacg tggcgcgagc 600 

40 cggagatccc ggtggtctcc aacgtgaccg gccggttcgc cgagcccggc gaactgaccg 660 

41 agccgggcta ctgggccgag cacgtgcggc ggccggtgcg gttcgccgag ggcgtcgcgg 720 

42 ccgcgacgga gtccggcggc tcgctgttcg tggagctcgg gccgggggcg gcgctgaccg 780 

43 ccctcgtcga ggagacggcc gaggtcacct gcgtcgcggc cctgcgggac gaccgcccgg 840 

44 aggtcaccgc gctgatcacc gcggtcgccg agctgttcgt ccgcggggtt gcggtcgatt 900 
4 5 ggccggccct gctgccgccg gtcaccgggt tcgtcgacct gccgaagtac gccttcgacc 960 
46 agcagcacta ttggctgcag cccgccgcgc aggccacgga cgcggcctcg ctcgggcagg 1020 
4 7 tcgcggccga ccacccgctg ctgggcgcgg tggtccggct gccgcagtcg gacggcctgg 1080 

48 tcttcacctc gcggctgtca ttgaaatcgc acccgtggct ggccgaccac gtcatcggcg 1140 

49 gggtggtgct cgtcgcgggc accgggctcg tcgagctggc cgtccgggcc ggggacgagg 1200 

50 ccggctgccc ggtcctcgaa gaactcgtca tcgaggctcc gctggtcgtc cccgaccacg 1260 

51 gcggggtccg gatccaggtc gtcgtggggg caccggggga gaccggttcg cgcgcggtcg 1320 

52 aggtgtactc cctgcgcgag gacgccggtg ccgaagtgtg ggcccggcac gccaccgggt 1380 

53 tcctggctgc gacgccgtcg cagcacaagc cgttcgactt caccgcctgg ccgccgcccg 1440 

54 gcgtcgagcg cgtcgacgtc gaggacttct acgacggctt cgtcgaccgc gggtacgcct 1500 
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55 acgggccgtc gttccggggc ctgcgggcgg tgtggcggcg cggcgacgaa gtgttcgccg 1560 

56 aggtcgccct ggccgaggac gaccgcgcgg acgcggcccg gttcggcatc caccccggcc 1620 

57 tgctggacgc cgccctgcac gcgggcatgg ccggtgccac caccacggaa gagcccggcc 1680 

58 ggccggtgct gccgttcgcc tggaacggcc tggtgctgca cgcggccggg gcgtccgcgc 1740 

59 tgcgggtccg gctcgccccg agcggtccgg acgccctgtc ggtcgaggcc gcggacgagg 1800 

60 ccggcggtct cgttgtgacg gcggactcgc tggtctcccg gccggtgtcg gccgaacagc 1860 

61 tgggcgcggc ggcgaaccac gacgcgttgt tccgcgtgga gtggaccgag atttcctcgg 1920 

62 ctggagacgt tccggcggac cacgtcgaag tgctcgaagc cgtcggcgag gatcccctgg 1980 

63 aactgaccgg ccgggtcctg gaggccgtgc agacctggct cgccgacgca gccgacgacg 2040 

64 ctcgcctggt cgtggtgacc cgcggcgccg tccacgaggt gactgacccg gccggtgccg 2100 

65 cggtgtgggg cctgatccgg gccgcgcagg cggaaaaccc ggaccggatc gtgctgctgg 2160 

66 acaccgacgg tgaagtgccg ctaggccggg tgctggccac cggcgagccc caaacagccg 2220 

67 tccgaggcgc cacgctgttc gccccgcggc tggcccgcgc cgaggccgcg gaggcaccgg 2280 

68 cagtgaccgg cgggacggtc ctgatctcgg gcgccggctc gctgggcgcg ctcaccgccc 2340 

69 ggcacctggt cgcccggcac ggagtccggc ggctggtgct cgtcagccgc cgtggccccg 2400 

70 acgccgacgg catggccgaa ctgaccgctg aactcatcgc tcagggcgcc gaggtcgccg 2460 

71 tagtcgcttg cgacctggcc gaccgggacc aggtccgggt actgctggcc gagcaccgcc 2520 

72 cgaacgccgt cgtgcacacg gccggtgttc tcgacgacgg cgtcttcgag tcgctgacgc 2580 

73 gggagcggct ggccaaggtc ttcgcgccca aagttactgc tgccaatcac ctcgacgagc 2640 

74 tgacccgcga actggatctt cgcgcgttcg tcgtgttctc ctccgcctcc ggggtcttcg 2700 

75 gctccgccgg gcagggcaac tacgccgctg ccaacgccta cctggacgcc gtggtcgcca 2760 

76 accgccgggc cgcgggcctg cccggcacat cgctggcctg gggcctgtgg gaacagaccg 2820 

77 acgggatgac cgcgcacctc ggcgacgccg accaggcgcg ggcgagtcgc ggcggggtcc 2880 

78 tcgccatctc acccgccgaa ggcatggagc tgttcgacgc agcgccggac gggctcgtcg 2940 

79 tcccggtcaa gctggacctg cgcaagaccc gcgccggcgg gacggtgccg cacctgctgc 3000 

80 gcggcctggt ccgcccggga cggcagcagg cccgtccggc gtccactgtg gacaacggac 3060 

81 tggccgggcg actcgccggg ctcgcgccgg cggagcagga ggcgctgctg ctcgacgtcg 3120 

82 tccgcacgca ggtcgcgctg gtgctcgggc acgccgggcc ggaggccgtc cgcgcggaca 3180 

83 cggcgttcaa ggacaccggc ttcgactcgc tgacgtcggt ggaactgcgc aaccggctgc 3240 

84 gcgaggcgag cgggctgaag ctgcccgcga cgctcgtctt cgactacccg acgccggtcg 3300 

85 cgctggcccg ctacctgcgt gacgaattcg gcgacacggt ggcaacaact ccggtggcca 3360 

86 ccgcggccgc agcggacgcc ggcgagccga tcgccatcgt cggcatggcg tgccggctgc 3420 

87 cgggcggggt caccgatccc gaaggcctgt ggcgcctggt gcgcgacggc ctcgaagggc 3480 

88 tgtctccctt ccccgaggac cggggctggg acctggagaa cctgttcgac gacgaccccg 3540 

89 accgctccgg cacgacgtac accagccggg gcgggttcct cgacggcgcc ggcctgttcg 3600 

90 acgcgggctt cttcgggatt tcgccgcgcg aggcgctggc catggacccg cagcagcggc 3 660 

91 tgctgctcga ggcggcctgg gaagccctcg aaggcaccgg tgtcgacccg ggctcgttga 3720 

92 agggcgccga cgtcggggtg ttcgccgggg tgtccaacca gggctatggg atgggcgcgg 3780 

93 atccggccga actggcgggg tacgcgagca cggcgggcgc ttcgagcgtc gtctcgggcc 384 0 

94 gagtctcgta cgtcttcggg ttcgaaggac cggcggtcac gatcgacacg gcttgctcgt 3900 

95 cgtcgctggt ggcgatgcac ctggccgggc aggcgctgcg gcagggcgag tgctcgatgg 3960 

96 ccctggccgg tggcgtcacg gtgatgggga cgcccggcac gttcgtggag ttcgcgaagc 4020 

97 agcgcggcct ggccggcgac ggccggtgca aggcctacgc cgaaggcgcg gacggcacgg 4080 

98 gctgggccga gggcgtcggg gtcgtcgtgc tggagcggct gtcggtggcg cgcgagcgcg 4140 

99 ggcaccgggt gctggccgtg ctgcgcggca gcgcggtcaa ctccgacggc gcgtccaacg 4200 

100 gcctgaccgc ccccaacggg ccgtcgcagc aacgggtgat ccgccgggcc ctggccggcg 4260 

101 ccggcctcga accgtccgat gtggacatcg tggaagggca cggcaccggg acggcgctgg 4320 

102 gcgacccgat cgaggcgcag gccctgctgg ccacctacgg caaggaccgc gacccggaga 4380 

103 cgccgttgtg gctggggtcg gtgaagtcga acttcggcca cacgcagtcc gcggccggcg 4440 
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104 tggccggggt gatcaagatg gtgcaggcgc tgcgccacgg cgtcatgccg 

105 acgtggaccg gcccaccagc caggtcgact ggtccgcggg ggccgtcgaa 

106 aggcacggga gtggccgcgg aacggccgtc cgcgccgggc cggggtgtcc 

107 tcagcggcac gaacgcccac ctgatcatcg aagaagcacc ggccgagcca 

108 gaccaccgcc ggacggcggt gtggtgccgc tggtcgtctc ggctcgcagc 

109 tggccggtca ggcgcgtcgg ctggccacgt tcctcggcga cgggcccctt 

110 ccggtgcgct gacgagccgc gccctgttcg gcgagcgcgc ggtcgtcgtg 

111 ccgaggaagc ccgcgccggt ctgggcgcac tggcccgcgg cgaagacgcg 

112 tccgcggccg ggtgcccgcg tccggcctgc cgggcaagct cgtgtgggtg 

113 aggggacgca gtgggtgggc atgggccgcg aactcctcga agagtctccg 

114 agcggatcgc cgagtgtgcg gccgcgctgg agccgtggat cggctggtcg 

115 tcctccgtgg cgacggtgac ctcgatcggg tcgatgtgct gcagcccgcg 

116 tgatggtcgg cttggccgcg gtgtggtcct cggccggggt ggtccccgat 

117 gccactccca gggtgagatc gccgcggcgt gcgtgtcggg tgcgttgtcg 

118 cggcgaaggt ggttgccctg cgcagccagg ccatcgccgc gaagctctcc 

119 ggatggcttc ggtcgccttg ggcgaagccg atgtggtgtc gcggctggcg 

120 aggtggctgc cgtcaacggt ccggcgtccg tggtgatcgc gggggatgcc 

121 acgaaacgct ggaagcgctg tccggtgcgg gaatccgggc tcggcgggtg 

122 acgcctcgca cacccggcac gtcgaagaca tcgaagacac cctcgccgaa 

123 ggatcgacgc ccgggcgccg ctggtgccgt tcctctccac cctcaccggc 

124 gggacgaggg cgtcgtggac ggcggctact ggtacc 

127 <210> SEQ ID NO: 2 

128 <211> LENGTH: 1875 

129 <212> TYPE: PRT 

130 <213> ORGANISM: Amycolatopsis mediterranei 
132 <400> SEQUENCE: 2 
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cccaccctgc 4500 
gtgctgaccg 4560 
tcgttcggga 4620 
cagcttgccg 4680 
cccggtgccc 4740 
tccgacgtcg 4800 
gcggattcgg 4860 
ccgggcctgg 4920 
ttccccgggc 4980 
gtgttcgccg 5040 
ctgttcgacg 5100 
tgctttgcgg 5160 
gcggtgctcg 5220 
ctggaggatg 5280 
ggccgcggcg 5340 
gacggggtcg 5400 
caggccctcg 5460 
gcggtggact 5520 
gcgctggccg 5580 
gagtggatcc 5640 
5676 
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167 



180 



185 



190 



169 Ala Gly Val Thr Trp Arg Glu Pro Glu lie Pro Val Val Ser Asn Val 

170 195 200 205 

172 Thr Gly Arg Phe Ala Glu Pro Gly Glu Leu Thr Glu Pro Gly Tyr Trp 

173 210 215 220 

175 Ala Glu His Val Arg Arg Pro Val Arg Phe Ala Glu Gly Val Ala Ala 

176 225 230 235 240 

178 Ala Thr Glu Ser Gly Gly Ser Leu Phe Val Glu Leu Gly Pro Gly Ala 

179 245 250 255 

181 Ala Leu Thr Ala Leu Val Glu Glu Thr Ala Glu Val Thr Cys Val Ala 

182 260 265 270 

184 Ala Leu Arg Asp Asp Arg Pro Glu Val Thr Ala Leu lie Thr Ala Val 

185 275 280 285 

187 Ala Glu Leu Phe Val Arg Gly Val Ala Val Asp Trp Pro Ala Leu Leu 

188 290 295 300 

190 Pro Pro Val Thr Gly Phe Val Asp Leu Pro Lys Tyr Ala Phe Asp Gin 

191 305 310 315 320 

193 Gin His Tyr Trp Leu Gin Pro Ala Ala Gin Ala Thr Asp Ala Ala Ser 

194 - ' 325 330 335 

196 Leu Gly Gin Val Ala Ala Asp His Pro Leu Leu Gly Ala Val Val Arg 

197 340 345 350 

199 Leu Pro Gin Ser Asp Gly Leu Val Phe Thr Ser Arg Leu Ser Leu Lys 

200 355 360 365 

202 Ser His Pro Trp Leu Ala Asp His Val lie Gly Gly Val Val Leu Val 

203 370 375 380 

205 Ala Gly Thr Gly Leu Val Glu Leu Ala Val Arg Ala Gly Asp Glu Ala 

206 385 390 395 400 

208 Gly Cys Pro Val Leu Glu Glu Leu Val lie Glu Ala Pro Leu Val Val 

209 405 410 415 

211 Pro Asp His Gly Gly Val Arg He Gin Val Val Val Gly Ala Pro Gly 

212 420 425 430 

214 Glu Thr Gly Ser Arg Ala Val Glu Val Tyr Ser Leu Arg Glu Asp Ala 

215 435 440 445 

217 Gly Ala Glu Val Trp Ala Arg His Ala Thr Gly Phe Leu Ala Ala Thr 

218 450 455 460 

220 Pro Ser Gin His Lys Pro Phe Asp Phe Thr Ala Trp Pro Pro Pro Gly 

221 465 470 475 480 

223 Val Glu Arg Val Asp Val Glu Asp Phe Tyr Asp Gly Phe Val Asp Arg 

224 485 490 495 

22 6 Gly Tyr Ala Tyr Gly Pro Ser Phe Arg Gly Leu Arg Ala Val Trp Arg 
227 500 505 510 

229 Arg Gly Asp Glu Val Phe Ala .Glu Val Ala Leu Ala Glu Asp Asp Arg 

230 515 520 525 

232 Ala Asp Ala Ala Arg Phe Gly He His Pro Gly Leu Leu Asp Ala Ala 

233 530 535 540 

235 Leu His Ala Gly Met Ala Gly Ala Thr Thr Thr Glu Glu Pro Gly Arg 

236 545 550 555 560 

238 Pro Val Leu Pro Phe Ala Trp Asn Gly Leu Val Leu His Ala Ala Gly 

239 565 570 575 
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